Perfect hash function

A perfect hash function for a set S is a hash function that maps distinct elements in S to a set of integers, with no collisions. A perfect hash function has many of the same applications as other hash functions, but with the advantage that no collision resolution has to be implemented.

1 Properties and uses
2 Minimal perfect hash function
3 See also
4 References
5 Further reading
6 External links

Properties and uses

A perfect hash function for a specific set S that can be evaluated in constant time, and with values in a small range, can be found by a randomized algorithm in a number of operations that is proportional to the size of S. The minimal size of the description of a perfect hash function depends on the range of its function values: The smaller the range, the more space is required. Any perfect hash functions suitable for use with a hash table require at least a number of bits that is proportional to the size of S.

A perfect hash function with values in a limited range can be used for efficient lookup operations, by placing keys from S (or other associated values) in a table indexed by the output of the function. Using a perfect hash function is best in situations where there is a frequently queried large set, S, which is seldom updated. Efficient solutions to performing updates are known as dynamic perfect hashing, but these methods are relatively complicated to implement. A simple alternative to perfect hashing, which also allows dynamic updates, is cuckoo hashing.

Minimal perfect hash function

A minimal perfect hash function is a perfect hash function that maps n keys to n consecutive integers—usually [0..n−1] or [1..n]. A more formal way of expressing this is: Let j and k be elements of some finite set K. F is a minimal perfect hash function iff F(j) =F(k) implies j=k and there exists an integer a such that the range of F is a..a+|K|−1. It has been proved that a general purpose minimal perfect hash scheme requires at least 1.44 bits/key.^[1] However the smallest currently use around 2.5 bits/key.

A minimal perfect hash function F is order preserving if keys are given in some order a₁, a₂, ..., and for any keys a_j and a_k, j<k implies F(a_j)<F(a_k). Order-preserving minimal perfect hash functions require necessarily Ω(n log n) bits to be represented.

A minimal perfect hash function F is monotone if it preserves the lexicographical order of the keys. Monotone minimal perfect hash functions can be represented in very little space.

References

^ Djamal Belazzougui, Fabiano C. Botelho, Martin Dietzfelbinger (2009) (PDF). Hash, displace, and compress. Springer Berlin / Heidelberg. http://cmph.sourceforge.net/papers/esa09.pdf. Retrieved 2011-08-11.

External links

Minimal Perfect Hashing by Bob Jenkins
gperf is a Free software C and C++ perfect hash generator
cmph is Free Software implementing many perfect hashing methods
Sux4J is Free Software implementing perfect hashing, including monotone minimal perfect hashing in Java
MPHSharp is Free Software implementing many perfect hashing methods in C#

Perfect hash function

Contents

Properties and uses

Minimal perfect hash function

See also

References

Further reading

External links